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(57) Abstract: A syramefric key cryptographic method is provided for short operations. The method includea batching a plurality of 
operation parameters (1503), and performing an operation according to a corresponding operation parameter (1505). The symmetric 
key cryptographic method is a Data Encryption Standard (DES) method. The short operations can be less than about 80 bytes. The 
short operatioDS can be between 8 and 80 bytes. The method includes reading the batched parametcis from a dynamic random i 
memory (1504), and transmitting each operation through a DES engine accorfing to the opeiatioiis parameter (1505). 
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IMPROVING DES HARDWARE THROUGHPUT FOR SHORT OPERATIONS 

This a non-provisional application claiming the benefit of provisional ^plication 
serial No. 60/201,002, filed May 1, 2000, 

The present invention relates to cryptographic support, and more particularly to . 
cryptographic support for short operations. 
Background Art 

Data Encryption Standard (DES) is a widely-used method of data encryption using 
10 private keys. There are 72 quadrillion or more possible encryption keys under the DES that 
can be used for protecting packets between parties over electronic networks. For each packet 
or message, a key is chosen at random. Like other symmetric key ciyptographic methods, 
both the sender and receiver need to know and use the same private key. 

DBS applies a 56-bit key to each 64-bit block of data. The process can run several 
15 modes and includes 16 rounds of operations. Although this is cotisidered strong encryption, 
tnany compames use triple-DES (XDES), which applies ^ee keys in succession to each 
packet. 

DES originated at IBM in 1977 and was adopted by the U.S. Department of Defense. 
It is spedfied in the ANSI X3.92 and X3.106 standards and in the Federal 
20 Processing Standards (FIPS) 46 and 81 standards. 

Typically, cryptographic methods focus on large packets (greater than about 80 
b3^es). However, when a DES system is used for smaller packets, the performance may drop 
by an order of magnitude. 



I 
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Therefore a need exists for a system and method of cryptographic siq)port for DES 
aperalious which has high throughput for long (>80 bytes) and shorter packets. 
Disclosure of the Invention 

According to an embodiment of the present invention, a symmetric key cryptographic 
S method is provided for short operations. The method includes batching a plurality of 

operation parameters, and performing an operation according to a corresponding operation 
parameter. The symmetric key cryptographic method is a Data Encryption Standard (DES) 
method The short operations can be less than about 80 bytes. The short operations can be 
between 8 and 80 bytes. 

10 The method includes batching the plurality of operation parameters and a plurality of 

DBS operation into a single request, calling DBS for each operation in the request, and 
perfomiing DBS for each, operation separately according to the corresponding operation 
parameter. 

The method further includes batching the plurality of operation parameters and a 
15 plurality of DB!S operations into a single request, calling DBS for the batched operations, and 
performing DBS for each operation separately according to the corresponding operation 
parameter. Each request is perfixmed with a chip reset, a key and an initialization vector. 
Calling the DBS for the batched operations fiirther cdnq>rises switching a context for the 
batched operations. Hie context switch is between an application layer and a system software 
20 layer. 

The method includes reading the batched parameters from a dynamic random access 
memory, and transmitting each operation through a DES engine according to the operations 
parameter. 



2 
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According to an embodiment of the present invention, a method is provided for 
improved DBS short operation throughput. The method includes batching a plurality of 
operation parameters, each operation parameter corresponding to an operation^ reading the 
batched operation parameters into a dynamic random access memory, and transmitting-each 
5 operation through a DBS engine according to the operations parameter. The DBS is 

extemal-to-extemal and an output for each operation is transmitted separately. The short 
operation can be less than about 80 bytes. The short operation can be between 8 and 80 
bytes. 

According to an embodiment of the present invention^ a symmetric key cryptographic 
10 method is provided for operations between about 8 and about 80 bytes in length. The method 
includes providing a key index to an engine, and pumping the operations through the engine 
in bulk wherein a central processing unit does not handle the bytes. The engme is a DBS 
engine. 

The method includes resetting an engine chip for an operation^ reading an 
15 initialization vector, and loading fb& initialization vector into the engine chip. The method 
further includes determining a key from the key index, loading the key into tiie engine chip, 
and reading a data length for the operation. 

The method includes transmitting the data length through an Input channel into the 
engine chip» and tran^nitting the data length through an Output channel. The channels are 
20 FIFOs. 

Brief Description of Drawings 

Preferred embodiments of the present invention will be described below in more 
, detail, with reference to the accompanying drawings: 
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Fig, 1 is a diagram of the DES architecture according to an embodiment of the present 
invention; 

Fig. 2 is anotter diagram of the DES architecture according to. an embodiment of the 
present invention; 

5 Fig. 3 is still another diagram of the DES architecture accordmg to an embodiment of 

the present invention; 

Fig. 4, is yet another diagram of the DES architecture according to an embodiment of 
the present invention; 

Fig. 5 is a diagram of the FIFO structure supporting DBS/TDES with a coprocessor 
10 according to an embodiment of the present iuvention; 

Fig. 6 is another diagram of the FIFO structure supporting DES/TDES with a 
coprocessor according to an embodiment of the present invention; 

Fig. 7 is still another diagram of the FIFO structure supporting DES/TDES with a 
coprocessor according to an embodiment of the present invention; 
15 Fig. 8 is yet another diagram of the FffO structure supporting DES/^ 

coprocessor according to an embodiment of the present invention; 

Fig. 9 is a furflier diagram of the FIFO structure supporting DES/TDES with a 
coprocessor according to an embodiment of the present invention; 

Fig. 10 is a diagram of the FIFO stmcture supporting DES/TDES with a coprocessor 
20 according to an embodiment of the present invention; 

Fig, 1 1 is a flow diagram of an application handling two operations as separate 
sccRequests according to the prior art; 

Fig. 12 is a flow diagram illiistrating a batched host-card interaction according to an 
embodiment of the present invention; 

4 
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Fig. 13 is a flow diagram of multiple operations batched into a single call according 
to an embodiment of the present invention; 

Fig. 14 is a flow diagram of a method which reduces data transfers for each operation, 
according to an embodiment of the present invention; 
5 Fig. 15 is a flow diagram- of a method which batches parameters for all operations 

into a block according to an embodiment of the present invention; and 

Fig. 16 is a graph illustrating DES speeds for various embodunents of the present 
• invCTtion. 

Best Mode for Carrying Out the Invention 

10 The present invention provides a system and mefliod for cryptographic support which 

has high throughput for long and short DES operations. According to an embodiment of the 
present invention, the s>^tem includes a multi-chip embedded module, packaged in a : 
Peripheral Component Interconnect (PCI) cant In .addition to cryptographic hardware and 
circuitry for tamper detection and response, a general-purpose computing enviromnent is 

1 5 provided including a central processing unit, and executing software stored in ROM and/or 
Flash memory . 

Referring to Fig. 1, the multiple-layer software architecture of the cUent 101 and the 
host 105 is shown. The client-side includes foundational security control in Layers 0 and 1 
102, a supervisor-level software system in Layer 2 103, and a user-level software application 
20 in Layer 3 104. Layer 2 103 supports application development Within Layer 2 103, a kernel 
provides tiie operating system abstractions of multiple procei^ses and address spaces; these 
abstractions support independent managers, which handle cryptographic hardware and other 
input/output (I/O) on the bottom, and provide higher-level application program interfeces 
(APIs) to the Layer 3 application 104. An API is the specific method prescribed by a 
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computer or by another program by which a programmer writing an application program can 
make requests of the operating, system or another application. Typically, the Layer 3 
application 104 in turn provides an abstraction of its own API to a host-^side application 107. 
The host-side 105 includes a device driver 106 and a host application 107. According 
5 to Fig. 2, for the Layer 3 application 104 to use a service provided by the card-side 

application, the host-side application 107 issues a call to the host-side device driver 106. The 
device driver 105 opens an sccRequest 108 to the Layer 2 system 103 on the device. 
Layer 2 103 informs the Layer 3 application 104 resident on the device of the existence of 
the request, and the x>arameters the host sent along with the request. 

10 According to Figs. 3 and 4, the Layer 3 application 104 handles the host application's 

request for service, for example, it can dfrect Layer 2 103 to transfer data 109 to the device 
driver 106 and perform the needed cryptographic operations. The Layer 3 ai>plication 104 
closes out the sccRequest 110 and sends the ou^utbaok 111 to the host application 107* 
According to an embodiment of the present invention, a device for fast cryptography 

15 is provided. The device includes a coprocessor having a central processing unit (CPU), at 
least two levels of internal software and at least three data paths. The software levels can 
include an operation system or kemal level and an application leveL The data paths can 
include an extemal to internal memory and/or CPU path, an internal memory and/or CPU to 
a syminetric engiae path, and a channel between the extemal system and the symmetric 

20 engine. The channel can be a first-in first-out (FIFO). According to an embodiment of the 
present invention, the device includes a FIFO state machine. The FIFO state machine 
stmctnre transports or drives data into and out of the method engine. 

It should be noted that while the present invention is presented in terms of a 
symmetric cryptographic function (e.g., DBS), the invention contemplates any parameterized 
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function on variable length datai Thus, DBS is provided as an example of an embodiment of 
the present invention and given* the teachings of the present invention provided herein, one of 
ordinary skill in the related art will be able to contemplate these and similar implementations 
or configurations of the present rnvention. 
5 Referring to Fig, 5, the FIFO structure works wifli the DES/TDES engine 500. The 

present invention is described according to an IBM 4758 coprocessor, specifically Models 
002/023 PCI cryptographic coprocessors, however, given the teaching of the present 
invention provided herein, one of ordinary skiU in the related art will be able to contemplate 
these and similar implementations or configurations* . . 

10 In Model 2 hardware^ the FIFO structure also supports fest Secure Hash Algorithm 1 

(SHA-1); though the structure may be applied to any method engine. 

For both input and output, two pairs, of FIFOs 501-504, a PCI FIFO pair 501-502 and 
an internal FIFO pair 503-504 are provided for external and internal transfer, respectively, as 
weU as a Direct Memory Access (DMA) controller 505-506 for CPU-fi:^e transfer into and 

15 out of internal dynamic random access memory (DRAM) 507. 

The internal CPU 508 selects which data paths to activate, and what key, initialization 
vector (IV)^ and other operational parameters the DBS engine 500 may use, via control 
rejgisters (not shown). The IV is generated by a random number gaierator, typically included 
in the Layer 2 system, and combined with liie unenoryted text and the key. The key is a 

20 variable value applied to a bloct of unencrypted text to produce encrypted text. 

Configurations of the DES engine 500 include bulk extemal-to-extemal DBS (shown 
in Fig. 8), bulk intemal-to-intemal DES (output DMA 506 to intemal input FIFO 503 to DES 
500, then back through the Internal Output FIFO 504 and PCI Output FIFO 502), and DMA 
transfer (e.g., PCI input FIFO 501 to intemal input FIFO 503 to ixxpxA DMA 505 and from the 

7 
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Output DMA Controller 506 to "the Internal Output FIFO 504 and to the PCI Output FIFO 
502). Further, the DBS hardware can be configured in a bypass mode in which the 
commercial Layer 2 system does not use the hardware. 

One constraint on the system is that either both internal FIFO-DBS paths need be 
5 selected (bulk mode), or neither is to be selected. Another constrdnt is that the FIFO 

configurations cannot be altered until data transfer is paused, and the state machine driving 
the FIFOs will transfer data asynchronously until resoxirces are exhausted 

The internal CPU 508 can configure the FIFO hardware to support card applications 
in various ways. For example. Fig. 6 depicts a configuration in which the FIFOs bring data 
10 into the card via the DMA, such as when die host application opens up a sccReques t to 
the card application. Data passes from the Pd Input FIFO 501 to the Internal Input FIFO 503 
via 601, to die Input DMA Controller 505 via 602, to the DRAM 507 via 603 and 604. 

Referring to Fig. 7 depictmg a DES request, the card may transfer the operational 
parameters from the DRAM 507 into the DES chip 500. The internal CPU 508 loading 
15 operational parameters into the .DBS chip 500 from the DRAM 507 via lines 701-703. 

According to Fig. 8, if tlie DES request is for extemal-to-extemal DES, the card will 
configure the FIFOs to bring the data in from the host, through the DES chip 500 and back to 
the host The CPU 508 can configure the FIFOs 50 1-504 to stream data fi-om the host, 
through the DBS chip and back to the host via lines 801-804. 
20 Additionally, if the DBS request is for kitemal-to-intemal DES and is deteraiiaed to 

be too short foir DMA, the card may manually push the data bytes through. The CPU 508 can 
drive data from the DRAM 507 through the DES/TDES engine via programmed I/O and 
lines 901-904. 

8. 
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As depicted ia Fig. 10, wten the sccRequest is complete, the card may send the 
results back to the host via DMA. The internal CPU 508 can configure the FIFOs to send 

data from the DRAM 507 back to the host via the DMA and lines 1001-1004. 

■ 't 

The present invention ijroposes methods for increasing the throughput of short DES 
5 operations. The methods used for evaluating the present invention included, DES operations 
including cipher block chaining (CBC) encrypt and CBC-decrypt, with data sizes distributed 
uniformly at random between 8 and SO bytes. Chaining is a me^od which depends the 
decryption of a block of cipher text on all preceding blocks. The IVs and keys changed with 
each operation; the keys are tripple-DES (TDES) encrypted with a master key stored inside 

1 0 the device. Encrypted k^s, IVs and other operational parameters are sent in with each 

operation, but are not counted as part of the data throughput. Although the keys may change 
with each operation, the total number of keys is small, relative to the.number of requests. 
Referring to Fig. 16, the speeds obtained for DES operations are shown for various 
embodiments of the present invention. Using Model 1 hardware a speed indicated by 1601 

15 was achieved. 

A baseline implementation was established using a Model 2 prototype for the 
following embodiments. According to Fig. 1 1, the host application handles each operation 
1 101-1102 as a separate sccRequest 1 103-1 104 with Programmed Input/Output (PIO) 
DES. Tlie ic^lementation inclufles the host application which generates sequences of 

20 short-DES requests (cipher key, IV, data) and the card-side application. The card-side 
application catches each request, unpacks the key, sends the data, key, and IV to the DES 
engiue, and sends the results back to the host. Keys were randomly chosen over a set of 
cipher keys. Caching keys inside the card reduced, the extra TDES key decryption step and 
increased the speed 1602. 
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! • . -I 

: According to an embodiment of the present invention, fhe short-DES performance I 

i 

I can be enhanced by reducing the host-card interaction. Referring to Fig. 12, this includes 

j 

batching a large sequence of short-DES requests into one sccRequest 1201. The card-side 

application was modified accordingly to receive the sequence in one step, process each | 

j 

5 operation 1202-1205, and send the concatenated output back to the host in one step 1206. > 
i The Layer 3 application calls DE:S for each operation 1202 and 1204. Layer 2 perfbnns the 

I DES for each operation separately 1203 and 1205, Speeds obtained for the benchmark data 

j 

above where between about 18 to 23 Idlobyt^second and up to 40 kilobytes/second with 
key catching 1603. 

10 According to an embodiment of the present invention, by eliminating the DES chip 

reset for each operation the short-DES performance may be increased 1 604. By generating a 
sequence of short-DBS operation requests that all use one key, one direction (decrypt or 
encrypt), and IVs of zero (although the IVs may be arbitrary), a speed of about 360 
kilobytes/second can be achieved. The card-side application receives the op^ation sequence 

15 and sends the operation sequence to &e Layer 2 system. In Layer 2, a modified DES 

Manager (the component controlling the DES. hardware) sets up the chip with the key and a^ 
IV of zero, and trmsmits the data through the chip. The end of each operation, the DES 
Manager performs an exclusive-or (XOR) to break the chaining. For example, for encryption, 
the software manually XOR's the last block of cipher text from the previous operation with 

20 . the first block of plain text for the operation, in order to cancel out the XOR that the chip 
would do. 

According to the batching method, besides reducing the number of chip resets, the 
number of context switches between ^ Layer 3 and Layer 2 is reduced fi^m 0(n) to 0(1), 
where n is the number of operaitions in the batch. Referring to Fig, 13, according to another 

10 
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! embodiment of the present invention, by using the multi-key, non-zero-IV setup (results 

I shown as 1603), the card-side application 1302 was altered to send batched requests 1301 to 

; 

j a modified DBS Manager (Layer 2) 1303- 1 3 04, thus reducing the number of context 

i 

switches. The card-side application 1302 calls DES for the batched operations. The modified 

i . ' ■ 

I 5 DES Manager 1303-1304 processes each request with a chip reset and a new key and IV, The 

I requests are sent to the host 1305. The results obtained using the modified DES Manager 

\ 

1303-1304 are shown as 1604 in Fig. 16. 

i 
i 

1 According to yet another embodiment of the present invention, the FIFO state 

I 

I machtaepiimpsdatab3^throughD£Sinabulkmode. Thus, the CPU 

I 10 data bytes. According to the prior methods, each byte of the cipher key, IV, and data was 

1 * . 

handled many times. The bytes came in via FIFOs and DMA into the DRAM with an initial 

sccReques t buffer transfer. The CPU takes the bytes out of DRAM and puts them into the 

DES chip. The CPU takes the data out of the DES chip and puts it back into DRAM. The 

CPU sends the data back to the host through the FIFOs. Accordingly, by reducing the number 

15 of data transfers the throughput can he increased 1605. Key uii^acking is eliminated as a 
built-in part of the API. Each application may have a imique method of unpacking, making 
. . the API unpacking redundant; Within each application an initialization stq> concludes witii a 
plain text key table resident in the device DRAM. The operation lengths were standardized to 
40 bytes. In addition, the host application was modified to generate sequences of requests 

20 that include an index into the internal key t^ble» instead of a cipher key. Thus, the card-side 
JOTlication 1401 calls the modified DES Manger 1402 and 1407 and makes the key table 
1403 and 1408 available to it, rather than immediately bringing the request sequence from 
. the PCI Input FIFO into DRAMV For each operation the modified DES Manager 1402 and 
1407 resets the DES chip; reads the IV and loads it into the chip; reads and sanity checks the 

11 
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key table, looks tip the key, aad loads it into the ciup; and reads the data length for the 
operation. The modifiled DES Manager sets up the state machine to transmit that number of 
bytes through the Input FIFOs into the DES chip then back out the Output FIFOs 1404-1406 
and 1409-1411.- The card-side application closes out the request 1412. The results are shown 
5 as 1605 in Fig. 16. 

According to an embodiment of the present invration. The number of Industry 
Standard Architecture (ISA) I/O instructions was increased (doubled) which reduced the 
throughput by half, showing a correlation between the ISA I/O instructions and the 

throughput speed. The modified DES Manager described above (with respect to 1605 and 

^ . 

10 Fig. 14) was then modified to use memory-mapping I/O ports instead of ISA I/O when 
available (the hardware used did* not provide memory mapped I/O ports for all instances). 
The software was also modified to eliminate any spurious FIFO reads caused by certain state 
machine polling intermittently. The results are shown as 1606 in Fig, 16* 

Referring to Fig. 15, by batching the parameters together, the parameters can be read 

15 via memory-mapped operations^ allowing modification of the FIFO configuration and the 
processing of the data. Layer 3 calls* £)ES for the batched operations 1501 . The host 
application batches Ihe per-bperation parameters into one group 1 503, attached to fiie input 
data. The modified DES Manager sets up the Internal FIFOs and the state machine to read Ihe 
batched parameters, by-passing the DES chip 1502; reads the batched parameters via 

20 memory-nuqpped I/O from the Internal Output FIFO mto DRAM 

reconfigures the FIFOs; and, using the buffered paiametCTS, sets up the state machine and the 

.- •• * - 

DES chip to transmit each operation's data 1506 and 1510 from the input FIFOs, through the 
DES, then back out the Output FIFOs 1505, 1507 and 1509 and 1511. Layer 3 closes out the 

12 
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request 1512. The results are shown in 1607 in Fig, 16. The accuracy of the method may be 

increased by accessing the IV and data length registers through the ISA method 1608, 

According to the present invention, the short-DES speed can be determined according 

to the following relationship: . . , 

^ Ci * Batches + C2 ■ Batches * Ops + C3 * Batches > Ops - DataLen 

Batches • C^s - DataLen 

where Batches is the number of host-card batches. Ops is the number of operations per batch, 

DataLen is the average data length per operation, and C/, C?, and C3 are unlaiown constants 

representing the per-batch per-o^ration and per-byte overheads, respectively. 

Hie present invention contemplates eliminating the per-batch overhead Ci by 

10 mddifying the host device driyiea--I^yer 2 interaction to enable indefinite sccRequest, with 
added polling or signaling to inciicate when additional data is ready for transfer. The 
per-operation overhead C2 may.]be reduced by minimizing the number of per-operation 
parameter transfers. For example, the host i5)plication may, within a batch of operations, 
interleave parameter blocks 'that assert arguments such as, the next N operations all use a 

15 particular key. This method eliminates bringing in and reading the key 

iteration. Another example can includes the host application processing the TVs before or 
after transmitting the data to the card. This is not a security issue if the host application is 
tmsted to provide the TVs. The*method eliminates bringing in the IVs and, because the DBS 
chip has a default IV of zeros after reset, eliminates loading the IVs* 

20 According to another embodiment of tiie present invention, per-operation overhead 

may be reduced by redesigning the FIFOs and the state machine. By modifying the DBS 
engine to expect data-input to mclude parameters interleaved with data, then the 
per-operation overhead C? maylipproach the per-byte overhead C?. The state machine 
handles fewer output bytes than input bytes and the CPU controls the class of engine 

13 
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operations over which the paraqieters, for example, chosen externally, are allowed to range. 
For example, the external entity .:piay be allowed to choose only certain types of encryption 
operations. Further, the CPU may insert indirection on the parameters the external entity 
chooses and the parameters the engine see, e,g., liie external entity provides an index into an 
5 internal table. 

Having described embodiments of a system and method of cryptography, it is noted 
that modifications and variations can be made by persons skilled in the art in light of the 
above teachings. It is therefore to be miderstood that changes may be made in the particular 
embodiments of the invention disclosed which are within the scope and spirit of the invention 
10 as defined by the appended claims. Having thus described the invention with the details and 
particularity required by the patent laws, what is claims and desired protected by Letters 
Patent is set. fortti in the appended claims. 



14 
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What is claimed is: 

L A symmetric key cryptographic method for short operations comprising the steps oC 
batching a plurality of operation parameters; and 

performuig an operation according to a corresponding operation parameter. 

5 

2. The method of claim 1^ wherein the symmetric key cryptographic method is a Data 
Enciyption Standard (DES) method 

3. The method of claim 1, wherein the short operations are less than about 80 bytes. 

10 

4. The method of claim 1, wherein the short operations are between 8 and 80 bytes. 

5. The method of claim 1 , further comprising the steps of: 

batching the plurality of operation parameters and a plurality of DES operation into a 
15 single request; 

calling DBS for each operation in the request; and 

performing DES for each operation separately according to the correspondiag 
operation parameter. 



20 



6. The method of claim 1, further comprising the steps of: 

batching the pteality of operation parameters and a plurality of DES operations into a 
single request; 
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calling DES for the batched operations; and 

performing DES for each operation separately according to the corresponding 
operation parameter* 

5 7, The metiiiod of claim 6, wherein each request is performed with a chip reset and a key 
and an initialization vector, 

8. The method of claim 6, wherein the step of calling the DES for the batched operations 
&rther coiaprises switching a context for the batched operations. 

10 

9. The method of claim 8, wherein the context switch is between an application layer 
and a system software layer. 

10. The method of claim 1, further comprising the steps of: 
reading the batched parameters fixnn a dynamic random access-memory; and 
transmitting each operation through a DES engine according to the operations 

parameter. 

11. A method for improved DES short operation throughput comprising the steps of: 

20 batching a plurality of operation parametOT, each operation parameter corresponding 

to an operation; 

reading the batched operation parameters into a dynamic random access memory; and 
transmittnig each op^:ation through a DES engine according to fhe operations 
parameter. 

16 
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12. The method of claim 10, wherein the PES is extemal-to-extemal and an output for 
each operation is transmitted separately. 

5 13. The mediod of claim 10, wherein the short operation is less than about 80 bytes. 

14 The method of claim 10» wherein the short operation is betwerai 8 and 80 bytes. 

15 A symmetric key cryptographic method for op^ations between about 8 and about 80 
10 bytes in length comprising the steps of: 

providing a key index to an engine; and 

pumping the operations througih the engine in bulk wherein a central processing unit 
does not handle tiie bytes. 

15 16 Themethodof claim 15 wherein the engme is a DES engine. 

17 The method of claim 15 further comprising the steps of: 
resetting an engine chip for an operation; 
readmg an initialization vector; 
20 loading the initialization vector into the engine chip; 

determining a key from the key index; 
loading the key into the engine chip; and 
reading a data length for the operation. 

17 
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j 

18 Themethodof claim 17 further comprismgtte 

i transmitting the data length through an Input channel into the engine chip; and 

i * 

I transmitting the data lengtia through an Outiyut channel. 

t 

i 

i 5 19. Th.e method of claim 18, wherein the channels are FIFOs. 
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